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Abstract. We analyze the natural process of flipping a coin which is caught in the hand. We show 
that vigorously flipped coins tend to come up the same way they started. The limiting 
chance of coming up this way depends on a single parameter, the angle between the normal 
to the coin and the angular momentum vector. Measurements of this parameter based on 
high-speed photography are reported. For natural flips, the chance of coming up as started 
is about .51. 
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I. Introduction. Coin tossing is a basic example of a random phenomenon. How- 
ever, naturally tossed coins obey the laws of mechanics (we neglect air resistance) and 
their flight is determined by their initial conditions. Figures 1(a)—(d) show a coin 
tossing machine. The coin is placed on a spring, the spring is released by a ratchet, 
and the coin flips up doing a natural spin and lands in the cup. With careful adjust- 
ment, the coin started heads up always lands heads up—one hundred percent of the 
time. We conclude that coin tossing is “physics” not “random.” 

Joe Keller [20] carried out a study of the physics assuming that the coin spins 
about an axis through its plane. Then, the initial upward velocity and the rate of spin 
determine the final outcome. Keller showed that in the limit of large initial velocity 
and large rate of spin, a vigorous flip, caught in the hand without bouncing, lands 
heads up half the time. This work is described more carefully in section 2 which 
contains a literature review of previous work on tossed and spinning coins. 

The present paper takes precession into account. We will use the term precession 
to indicate that the direction of the axis of rotation changes as the coin goes through 
its trajectory (see Figure 2(a)). Real flips often precess a fair amount and this changes 
the conclusion. Consider first a coin starting heads up and hit exactly in the center 
so it goes up without turning like a spinning pizza. We call such a flip a “total cheat 
coin,” because it always comes up the way it started. For such a toss, the angular 
momentum vector M lies along the normal to the coin, and there is no precession. 
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Fig. | 


003 
(a) (b) 


Fig.2 (a) Diagram of a precessing coin. (b) Coordinates of precessing coin: K is the upward 
direction, 7 is the normal to the coin, M is the angular momentum vector, and w3 is the 
rate of rotation around the normal ñ. 
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In section 3 we show that the angle ~ between M and the normal ñ to the 
coin stays constant. If this angle is less than 45°, the coin never turns over. It 
wobbles around and always comes up the way it started. In all of these cases there is 
precession. Magicians and gamblers can carry out such controlled flips which appear 
visually indistinguishable from normal flips [24]. 

For Keller’s analysis, M is assumed to lie in the plane of the coin with an angle 
of 90° to the normal to the coin; again there is no precession. We now state our 
main theorems. The various coordinate vectors are shown in Figure 2(b). Complete 
notational details are in section 3.1. We use capital letters for the laboratory frame 
and lowercase letters for the body-centered frame. In particular, ñ is the normal from 
the point of view of the coin and N(t) is the normal from the point of view of the 
observer at time t. At time zero, N(t) = N(0) = ñ. 

THEOREM 1. For a coin tossed starting heads up at time 0, let r(t) = N(t) . K 
be the cosine of the angle between the normal at time t and the up direction K. Then 


(1.1) T(t) =A+B cos(wyt), 


with A = cos? 4, B = sin? 4, wy = ||M||/i, = 4(mR? + imh?) for coins with ra- 
dius R, thickness h, and mass m. Here w is the angle between the angular momentum 
vector M and the normal at time t = 0, and ||- || is the usual Euclidean norm. 

Theorem 1 gives a simple formula for the relevant position of the coin as a function 
of the initial conditions. As shown below, the derived parameter wy will be large for 
vigorously flipped coins. To apply Theorem 1, consider any smooth probability density 
g on the initial conditions (wy,t) of Theorem 1. Keep w as a free parameter. We 
suppose g to be centered at (wo, to) so that the resulting density can be written in 
the form glwy — wo,t — to). Let (wo,to) tend to infinity along a ray in the positive 
orthant wy > 0,t > 0, corresponding to large spin and large time-of-flight. 

THEOREM 2. For all smooth, compactly supported densities g, as the product woto 
tends to infinity, the limiting probability of heads p(w) with y fixed, given that heads 
starts up, is 


(1.2) pl) = t + zsnt (cot?(Y)) if Ẹ <% < 37/4, 


1 ifO0<p<a/4or FZ cypK<r. 


A graph of p(w) appears in Figure 3. Observe that p(w) is always greater than 
or equal to 1/2 and equals 1/2 only if y = 7/2. In this sense, vigorously tossed coins 
((wo, to) large) are biased to come up as they started, for essentially arbitrary initial 
distributions g. The proof of Theorem 2 gives a quantitative rate of convergence to 
p(w) as wo and to become large. 

We now explain the picture behind Theorem 1 and some heuristics for Theorem 
2 (see Figure 4). While the coin is in flight its angular momentum is constant in 
time and the normal vector precesses around it at a uniform velocity, sweeping out 
a circle on the sphere of unit vectors. (This is proved in section 3.) On this sphere, 
draw the equator of vectors orthogonal to the direction K of “straight up.” Points on 
the equator represent the coin when only its edge can be seen. Points in the upper 
hemisphere H represent the coin “heads up” and points in the lower hemisphere T 
represent the coin “tails up.” H corresponds to T > 0 and T to 7 < 0, where 7 is the 
function of Theorem 1. 

Suppose now that the coin starts its travel precisely heads up, so that the normal 
is aligned with K. Then the normal N traces out a circle on the sphere passing 
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ply) os 


Fig.3 Probability of heads as a function of w. 


Fig.4 The normals to the coin lie on a circle intersecting with the equator of change of sides from 
heads to tails (hatched area). 


through K and having as its center the “random” point M (normalized). It follows 
from Theorem 1 that for all choices of M except for M lying in the equator (the Keller 
flip) more of this circle lies in the H hemisphere than the T. The coin appears biased 
toward heads. 

To obtain a quantitative expression for the bias we fix the angle Y, which is also 
the (spherical) radius of the circle described by the normal. To begin, note that if w 
is between 0 and 7/4, then A > B in (1.1) of Theorem 1 and so 7(t) > 0 for all time 
t. In this case the coin is always heads up and we are in the range where p = 1 in 
Theorem 2. In general, it is plausible that the probability of heads is proportional to 
the amount of time which Ñ spends in the hemisphere H. This proportion of time is 
precisely the probability p(w) of (1.2) in Theorem 2. 
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Fig.5 (a) Variations of the function T as a function of time t for p = 7/2. (b) Variations of the 


function T as a function of time t for p = 7/3. 


o 


Figures 5(a) and 5(b) show the effect of changing w. In Figure 5(a), Y = 5 and 
T of (1.1) is positive half of the time. In Figure 5(b), y = 3 and 7 is more often 
positive. 

Theorems 1 and 2 lead us to ask what the empirical distribution of w is when 
real people toss coins. In section 5 two empirical studies are described. The first is 
low-tech and uses a coin with a thin ribbon attached. The second uses a high-speed 
slow motion camera. The projection of a circle onto the plane of the camera is an 
ellipse. Using image analysis techniques we fit the ellipses to the images of the tossed 
coin. A simple function of the lengths of the major and minor axes gives the normal 
to the coin in three-space. As explained, these normals spin in a circle about the 
angular momentum vector which stays fixed during the coin’s flight. This gives an 
estimate of Y. Two methods of estimation which agree to reasonable approximation 
are given. 

The empirical estimates of w show that naturally flipped coins precess sufficiently 
to force a bias of at least .01. We find it surprising that this bias persists in the limit 
of vigorously flipped coins for general densities g(wy,t). 

The structure of the rest of the paper is as follows. Section 2 reviews previous 
literature and data on coin tossing. Section 3 reviews rigid body motion and proves 
Theorem 1. In section 3 we also derive an exact result for the amount of precession: the 
amount that the coin turns about its normal during one revolution of the normal about 
the angular momentum vector is m cos(w). This “internal rotation” is an example of a 
geometric or Berry phase [29]. The limiting results of Theorem 2 are proved in section 
4. Section 5 presents our data. Section 6 presents some caveats to the analysis along 
with our conclusions. 


2. Previous Literature. The analysis of classical randomization devices using 
mechanics and a distribution on initial conditions goes back to Poincaré’s analysis of 
roulette [32, pp. 122-130]. This was brilliantly continued in a sequence of studies by 
Hopf [15, 16, 17], who studied Buffon’s needle, introduced various mixing conditions 
to prove independence of successive outcomes, and gave examples where the initial 
conditions still influence later states. Hopf began a classification of low order ordinary 
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Fig.6 The hyperbolas separating heads from tails in part of phase space. Initial conditions leading 
to heads are hatched, tails are left white, w is measured in s~!. 


differential equations by sensitivity to initial conditions. While this work is little 
known today, [36] gives some further history, [34] offers a philosopher’s commentary, 
and [5] presents a detailed development with extensions. 

It cannot be emphasized too strongly that the results above are limiting results: 
Poincaré’s arguments suggest that as a roulette ball is spun more and more vigorously 
the numbers become closer and closer to uniformly distributed. There are numerous 
studies (see [1], [2]) suggesting that real roulette may not be vigorous enough to avoid 
the perennity of the initial conditions. 

The careful study of flipped coins was begun by Keller [20], whose analysis we 
briefly sketch here. He assumed that a coin flips about an axis in its plane with spin 
about this axis at rate w revolutions per second. If the initial velocity in the up 
direction K is vz, after t seconds a coin flipped from initial height zp will be at height 
zo + tvz — (g/2)t?. Here g is the acceleration due to gravity (g = 32 ft/(sec)? if height 
is measured in feet). If the coin is caught when it returns to 29, the elapsed time t* 
satisfies zo + t*uvz — (g/2)(t*)? = zo or t* = v./(g/2). The coin will have revolved 
wvz/(g/2) times. If, for some integer j, this number of revolutions is between 27 and 
2j +1, the initial side will be upmost. If the number of revolutions is between 27 + 1 
and 27 +2, the opposite side will be upmost. Figure 6 shows the decomposition of the 
phase space (w, t) into regions where the coin comes up as it started or opposite. The 
edges of the regions are along the hyperbolas wv/(g/2) = j. Visually, the regions get 
close together so small changes in the initial conditions cause the difference between 
heads and tails. 

The spaces between the hyperbolas in Figure 6 have equal area. The horizontal 
axis goes from t = 0.1s to t = 0.8s. The vertical axis goes from w = 0 to w = 1400. 

Asymptotic limits can be avoided by deriving explicit error terms for the approx- 
imations. Here is a theorem from [5], specialized to the coin tossing case. (Here we 
are still in Keller’s model, not in the context of Theorems 1 and 2 above.) Let f(w,t) 
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be a probability density on the (t,w) plane. Thus f(w,t) >Oand f f f(w,t)dwdt = 1. 
The marginal densities fı (w), fo(t) are defined by 


fale) = ffod falt) =f Hondo 
The conditional densities f.,(t) and f;(w) are defined by 


_ f(w,t) _ f(w,t) 
fi(w)’ fa(t) 


Thus f.,(¢) is the probability density which gives the chance that a random quantity 
with density f(w,t) is in the interval (t,t + dt) given an observed, fixed value of w. 
The following theorem applies to the case where Y = 7/2. The translation of f is the 
probability density f(w,t) = f(wo + w, to +t). 

THEOREM 3 (Engel-Kemperman). Let f(w,t) be a probability density on the 
(w,t) plane with marginal densities f\(w), fo(t), and differentiable conditional densi- 
ties fult), fi(w). Translate f to f(wtwo, t+ to). Then, the probability over the 
heads region in Figure 6 (the Keller coin) satisfies 


folt) fil) 


then) — 3] < 1min (Te, Me), 
2 to Wo 


where 
W= ff ie\lAwydoat, Wo = f [EOR dd. 


To see if this theorem is useful for natural coin tosses, we carried out empirical 
measurements similar to those reported in section 5 below. These measurements show 
that natural coin tosses (approximately one foot tosses of duration about 1/2 second) 
have initial conditions that satisfy 


(x) 36 <w < 40, 7< v, <9 equivalent to 0.44 < t < 0.56, 


with w measured in rev/sec, v, in ft/sec, and t in seconds. Assuming a uniform 
distribution on the region defined by (*), the Engel-Kemperman theorem gives 


1 
P(heads) — 5 < .056. 


All of the studies cited above assumed the coin is caught in the hand without 
bouncing. An analysis of the effect of bouncing in coin tossing was suggested by 
Vulovic and Prange [37]. Following Keller, they assumed that the coin rotates about 
an axis through its plane. Thus, the phase space is (w, t) as before. They hypothesized 
an explicit model for inelastic collisions that determines the coin’s eventual resting 
place. The resulting partitioning of (w, t) space is surprisingly similar to Keller’s (see 
Figure 6). The preceding analysis (as illustrated in Figure 6) shows a reasonably fine 
partitioning of the phase space. In these regions, Vulovic and Prange showed that 
bouncing causes a fractal structure to appear in regions far from zero. Then, bouncing 
appreciably enhances randomness. Zeng-Yuan and Bin [38] carried these considera- 
tions forward. They included both bouncing and air resistance. In their model, 
bouncing, a very nonlinear phenomenon, causes very pronounced sensitivity to initial 
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Fig.7 (a) Coins spun on their edges. (b) Tossed coins. 


conditions. They neglected precession and so did not encounter our phenomenon of 
unfair coins in the limit of large (w, t). 

An intriguing analysis of coin tossing appears in [18, section 10.3]. As a physicist, 
Jaynes clearly understands that conservation of angular momentum is the key to the 
analysis of coin tossing. With hindsight, we can find therein our statement following 
Theorem 1, that if the angle w is sufficiently acute, then the coin remains same side 
up throughout its trajectory. This result was also described to us by Toomre [35]. 
Jaynes discussed weighted coins and coins spun on the edge. He included some data, 
in which a jar lid is tossed or spun in various ways and extreme variations from the 
fair case of 50:50 ensue. 

We turn next to a different method of randomizing: coins spun on their edge. 
Here, the situation is much changed. Spun coins can exhibit huge variations from 
the fair probability situation. The exact determination of the probability depends 
in a delicate way on the shape of the coin’s edge and the exact center of gravity. 
Indeed, magicians use coins with slightly shaved edges, invisible to the naked eye, 
which always come up heads. While we will not pursue the details, we offer Figure 7 
as evidence. This shows data provided by Jim Pitman from a class of 103 Berkeley 
undergraduates who were asked to (a) toss a penny 100 times and record head or 
tail, and (b) spin the same penny 100 times and record head or tail. A histogram of 
the data appears in Figure 7. The tossed coins are absolutely typical of fair coins, 
concentrated near 50 heads. The spun coins show a pronounced proportion of tails; 
several students had coins that came up fewer than 10% heads. For further discussion, 
see [33]. 

If a coin is flipped up and allowed to bounce on the floor, our observations suggest 
that some of the time it spins around a bit on its edge before coming to rest. If this is 
so, some of the strong edge spinning effect comes into play. There may be a real sense 
in which tossed coins landing on the floor are less fair than when caught in the hand. 
People often feel the other way. But we suggest that this is because coins caught in 
the hand are easier to manipulate. While this is clearly true, ruling out dishonesty, 
we stick to our conjecture. 

Throughout, we have neglected the possibility of coins landing on their edge. 
We make three remarks in this direction. First, as a youngster, the first author was 
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involved in settling a bet in which 10 coins were tossed in the air to land on the table 
(this to be repeated 1000 times). On one of the trials, one of the coins spun about 
and landed on its edge. An analysis of coins landing on edge has been developed by 
Murray and Teare [31]. Using a combination of theory and experiment they concluded 
that a U.S. nickel will land on its edge about 1 in 6000 tosses. Finally, Mosteller [30] 
developed tools to study the related question, “How thick must a coin be to have 
probability 1/3 of landing on edge?” 

In light of all the variations, it is natural to ask if inhomogeneity in the mass 
distribution of the coin can change the outcome. The papers [26] and [13] give informal 
arguments suggesting that inhomogeneity does not matter for flipped coins caught in 
the hand. Jaynes reported that 100 flips of a jar lid showed no evidence of bias. We 
had coins made with lead on one side and balsa wood on the other. Again no bias 
showed up. All of this changes drastically if inhomogeneous coins are spun on the 
table (they tend to land heavy side up). As explained above, some of this bias persists 
for coins flipped onto a table or floor. 

Coin tossing is such a familiar image that it seems that large amounts of empirical 
data should be available. A celebrated study [21] is based on a heroic collection of 
10,000 coin flips. Kerrich’s flips allowed the coin to bounce on the table so our analysis 
does not apply. His data do seem random (p = 1/2) for all practical purposes. Further 
studies of Pearson (24,000 flips) and Lock (30,000 flips) are reported in [33]. 

Our estimate of the bias for flipped coins is p = .51. To estimate p near 1/2 
with standard error 1/1000 requires Im = 1/1000 or n = 250,000 trials. While not 
beyond practical reach, especially if a national coin toss was arranged, this makes it 
less surprising that the present research has not been empirically tested. 


3. Rigid Body Motion. This section sets up notation, reviews needed mechanics, 
and proves extensions of Theorems 1 and 2. Before plunging into details, it may 
be useful to have the following geometric picture of a tumbling, flipping coin. Start 
the coin heads up, its normal vector N aligned with the up direction K. The initial 
velocities determine a fixed vector M (the angular momentum vector). Picture this 
riding along with the coin, centered at the coin’s center of gravity, staying in a fixed 
orientation with respect to the coordinates of the room. The normal to the coin stays 
at a fixed angle w to M and rotates around M at a fixed rate wy. At the same time, 
the coin spins in its plane about N at a fixed rate W pr. This description is derived in 
section 3.1. Theorem 1 is proved in section 3.2 allowing the initial configuration of the 
coin to be in a general position. The amount AA of precession during one revolution 
of N about M is related to the angle y by AA = rcos(w). This is made precise in 
two ways in section 3.3, where it is seen as a kind of “geometric Berry phase.” As 
described there, we found this a convenient way to empirically estimate the crucial 
angle w. 

This section uses the notation and development of [25, Chapter 6]. A more 
introductory account of the mysteries of angular momentum appears in [9, Vol. I, 
Chapters 18-20]. The classical account of [14] spells out many details. An advanced 
treatment appears in [28]. 


3.1. The Basic Setup. We model the flipping coin as a homogeneous, symmetric 
rigid body. Its center of gravity Re moves according to Rg(t) = (X(0) + tVz, Y (0) + 
tV,, Z(0) + tV; — gt?/2), where V = (Vz, Vj, V)! is its initial velocity, and where 
the last coordinate is in the direction of gravity. In order to describe the tumbling 
of the coin we use two coordinate systems, both centered at Rg. One has its axes 
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directions fixed relative to an inertial, or laboratory, frame. Coordinates and vectors 
in this system are denoted by capital letters. The second coordinate system has its 
axes rigidly attached to the coin and is called the body frame, and its coordinates 
and vectors are denoted by small letters. For example, the normal to the coin as 
viewed from the laboratory frame is N and depends on time, while the same normal, 
viewed in the body frame, is written 7 and is a constant vector along the z axis. The 
two frames are related by a rotation matrix I(t), which takes a vector in the body 
frame to the same point in the laboratory frame. Thus, for example, ['(0) = id and 
N(t) = r(t). For an introductory account of moving frames such as our body frame, 
see, for example, [10]. 

The instantaneous angular velocity Q(t) is a way of encoding the time derivative 
of T(t). It is a vector such that if X is pointing to a fixed material point on the coin, 
so that X(t) = T(t)Z with Z constant in the body frame, then 
(3.1) x =t) x X. 


a 


It will be important to have a picture of the general solution X(t) to (3.1) in the 
case where ((t) = 9 is constant. Then the projection of X(t) onto the line through Q 
is constant, while the projection of X (t) onto the perpendicular plane to Q traverses 
a circle at constant angular speed ||Q||. Putting these two motions together, X(t) 
sweeps out a cone whose tip travels the circle centered at the projection of X (0) onto 
the line through Q, with the plane of this circle perpendicular to this line. 

In general, (t) changes with t. There are exactly two cases where 1(t) is con- 
stant: the total cheat coin and the Keller coin, as discussed in section 1. Suppose 
that the coin starts heads up with the normal pointed up (in direction K ). For the 


> 


“total cheat coin,” Q(t) is vertical, in line with the normal to the coin, and its length 
is constant. Then, the coin remains horizontal for all time and spins about the normal 


> 


direction with some constant angular speed w (the length of Q(¢)). Equation (3.1) 
becomes 


dX Lou SS 

— =wK xX. 

dt 
For the “fair coin” analyzed by Keller, Q(t) lies in the plane of the coin. If its 
direction is along J, then (3.1) becomes ax =wI x X, where again w is the length of 


the (constant) vector Q(t). This story was told in section 2. In all other cases, Q(t) 
depends on t. 

The evolution of the angular velocity can be determined by using the conservation 
of angular momentum and the linear relation between angular momentum and angular 
velocity. While we will not need it explicitly, the angular momentum vector M for a 
rigid body may be defined as a sum over particles a in the body, 


(3.2) M = XO maRa x Va, 
a 
with R, the distance of the ath particle from the center of gravity, V, its velocity, and 


Ma its mass. Conservation of angular momentum asserts that M is constant during 
the flight of the coin. 
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The vectors M and Q(t) are related by a symmetric positive definite matrix called 
the moment of inertia tensor [25, section 32]. This matrix is constant relative to the 
body frame. Its eigenvalues are called the principal moments of inertia and denoted 
L, In, 13. Because of the coin’s symmetry, Jı = Ig and the eigenvectors for J4, Iz span 
the plane of the coin. Let €1,é@2 be a basis of this plane. The eigenvector for I3 is 
Ti, the normal to the coin. If the coin is modeled as a solid cylinder of thickness h, 
radius p, and uniform density, then [25, p. 102, eq. 2(c)] shows that 


1 1 1 
(3.3) h=h= i (me + T and I; = zP» 


where m is the total mass. Note that in the h | 0 limit (i.e., a very thin coin) 
I, ~ I3/2. The experiments described in section 5 use a U.S. half dollar, which has 


h = 2.15mm, p= 15.3mm, m = 11.34g. 
Thus J = Iz = 0.681g.m? and J3 = 1.327g.m?. 


Let w, m be the angular velocity and angular momentum in the body frame. Thus 
ü = T(t) İQ and m = T(t)-'M. Expanding them out in terms of the eigenframe 
ĉi, 9, n yields 


w = We} aa W2é2 + w3, 


= 


5 > > 
m = M11 +M2e2 + MZN. 


The components w; and m; will depend on t in general. Applying the moment of 
inertia tensor to œ yields ™, where 


m= lwi + Tow €o + Izwan 


3.4 
a = hö + U3 — h )wsñ, 


where we have used that Jı = I2. We now apply the rotation matrix T(t) relating the 
two frames to obtain 


M = Lo + (Iz — L)wz Ñ. 


Solving for Q gives 


(3.5) h h 
= wyM — wprN, 
where 
(3.6) wy = M/h, wpr = E = =) M cos(4), M = M/M, 


M = ||M|| is the magnitude of the angular momentum M, and we have used M3 = 
M cos(w) = Izw3. We note that wpr is positive since I) < I3. 

The vectors M and {(t) are parallel precisely at the extremes of the total cheat 
coin and the fair coin. In the total cheat case the components wı = w2 = 0, the 
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vectors M and Ñ are proportional and constant in time, and M = TQ, so that © is 
also constant in time. In the fair case, the component wg is zero, M=I 19, and again 
Q is constant in time. In all other cases, the angular momentum and velocity vectors 
are not parallel. The vector M stays constant in laboratory coordinates, while N and 
Q move in laboratory coordinates. 

From (3.1) and (3.4)-(3.6) we can read off the geometric description which intro- 
duced this section. Equation (3.1) applies to the vector X = Ñ since 7 is fixed in the 
coin frame. Thus 


Then, (3.5) along with Ñ x Ñ = 0 imply 
(3.7) —— = wry M xN. 


This equation asserts that the coin’s normal vector precesses about the axis M and 
that the angular frequency of precession is wy = M/I;, as in (3.6). The following 
section solves (3.7) and uses this to prove Theorem 1. 


3.2. A Generalization of Theorem |. In the inertial frame K is a unit vector 
in the up direction, M is the unit vector in the direction of the angular momentum, 
and N(t) is the unit normal to the head of the coin. The coin is “up” if K- N > 0 
and “down” (tails) if K-N <0. Theorem 1 assumes that the coin starts with heads 
perfectly up (K = Ñ at time 0). The following extensions allow an arbitrary start for 
N. 

THEOREM 1*. Let f(t) = N(t)-K be the quantity which determines “heads” or 
“tails,” i.e., the cosine of the angle between the normal N(t) to the coin at time t and 
the up direction K. Define Y, by 


cos(y) = 
cos(4) = 

then 
f(t) =A+ Bcos(wyt + 80), 


with A = cos Y% cos ¢, B = sin Ysin ġ, wy = M/I for Iı given by (3.3), and the phase 
Oo is determined by K - Ñ (0). 

Note that if N (0) = K so the coin starts heads “exactly” up, then ~ = ¢ and 
Theorem 1* yields Theorem 1. (We will see in the proof—look at the definition of Ĝi 
there—that in this case ĝo = 0 also.) 

We have presented Figure 8 in order to help interpret Theorem 1* and T heorem 
pias Figure 8 represents a sphere of possible unit vectors M, with the two points Ñ (0) 
and K indicated along with their corresponding great circles (“polars”), which are the 
loci of points where N(0) - M = 0 and where K - M =0. The shaded region between 
these great circles is the region where M must be located in order for the coefficient 
A to be negative, and consequently in this region (see Theorem 2*) the asymptotic 
bias is toward tails as opposed to heads. 
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Fig.8 The situation of Theorem 1*. N(0) and the vertical K need not be equal. When M is in the 
shaded region the asymptotic bias is toward tails. 


Proof. From (3.7) for any t > 0, cosy = N(0)- M= N(t)- M: the normal to the 
coin makes a constant angle with the angular momentum vector. Introduce a new 
orthonormal basis for space (G1, G2, M), with G, in the plane spanned by M and K, 
so that Gs is perpendicular to both M and R. T hen, from the definition of ¢, 


K= sin(¢)Gy + cos($)M. 


The general solution of the first order linear differential equation (3.7) for N(t) is 
N(t) = aM + 0(G,cos(wyt + 00) + Go sin(wyt + 09). The initial value of Ñ(0) - M 
determines a = cos(7),b = sin(w). (The value of ĝo can be determined from Ñ (0) - & 
if needed.) Altogether, 


N(t) = cos(w)M + sin(w){cos(wyt + 09)G1 + sin(wyt + 09)Go}. 


The expression for f(t) = N(t) - R follows from the orthonormality of (Gy, Go,M ). O 

This completes the proof of Theorem 1 as well. 

The analytic argument of section 4 shows that for vigorously flipped coins the 
argument wyt of f in Theorem 1* is asymptotically uniformly distributed (mod 27). 
The following theorem extends Theorem 2 to the case where the coin does not start 
with heads perfectly up. The setup is exactly the same as that preceding Theorem 1*. 
In addition, we let w and ¢ tend to infinity in the following way. For all a,b nonzero, 
we start with a compactly supported probability density g(w,t)dwdt, and translate 
it along a ray w/t = a/b by forming g*(w,t)dwdt = g(w — Aa,t — Ab)dwdt. Here a 
and b are fixed parameters which define the direction of the ray and  parameterizes 
the distance along the ray. Then we compute the probability p(q, ¢; A) of heads with 
respect to g*, given that we started with the angular momentum making an angle 
w to the normal and an angle ¢ to the up direction. Then we take the limit of this 
probability as A > +oo. Theorem 5 shows that this limit exists; we will call it p(w, ¢). 

THEOREM 2*. Let J, p, 0< Y,p < 1/2, be defined as in Theorem 1*. As w and t 
tend to infinity, let p(w, ) be the limiting probability of heads for a coin toss, starting 


with heads up, with angle w between the normal N(0) to the coin and the angular 
momentum M, and angle ¢ between the up direction K and M. Then 


al 


$+ sin~‘(cot(d)cot(W)) if (cot d)(cot y) < 1, 


p(y, 6) = 1 if (cot($)) cot(%) > 1. 
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Proof. According to Theorem 5 of the next section, the limiting probability 
distribution of the angle 0 = wyt is uniformly distributed on [0,27). We must then 
evaluate the probability that f(@) > 0, where @ is uniformly distributed and f is 
the function of Theorem 1*. We have f(@) = A+ Bcos(@) with A = cos(w) cos(¢), 
B =sin(w)sin(¢). If A > B, then f(0) > 0 for all 0 and p = 1. This happens if and 
only if cot(¢)-cot(¢) > 1. To compute in the case A < B observe that f is symmetric 
about 0 = 7 and is monotone decreasing on (0,7). It follows that f has a unique zero 
6; in (0,7), that f is positive on 0 < 0 < 04, and that f is negative on 6; < 0 <7. 
The uniform measure of the set {0 : f(0) > 0} is then 


pv, o) _ 01/7. 


The zero 6; occurs when cos(@) = —A/B. So 6; = cos~*(—A/B). Using cos ($ +h) = 
—sin(h) gives 0; = Z +sin7'(A/B). Finally, A/B = cot ġ cot y. o 
This completes the proof of Theorem 2 as well. 


3.3. Precession of the Head. As explained in the introduction to this section, 
while the normal to the coin is spinning about the angular momentum vector at rate 
wy, the coin is also spinning about the normal at a constant rate wpr. We now further 
quantify and verify that assertion. 

THEOREM 4. With notation as in (3.3), (3.6), each time the normal vector 
completes one full cycle around the angular momentum vector, the coin has precessed 
by the angle 


(3.8a) AA= ter 2r = —(1 — I /Is)2rcos(4) 
N 
(3.8b) ~ —meos(y) as hl 0. 


Remark. When w œ 0 so that M is nearly aligned with the vertical, we have 
AA ~r. In other words, every time the normal vector precesses around once, the 
coin rotates approximately 180°. Feynman observed this phenomenon in a Cornell 
dining hall. In his own words, 

“some guy, fooling around, throws a plate in the air”. By noticing the 
difference between the plate’s angular velocity and that of the associated 
wobble, says Feynman, he was motivated to higher things: “The diagrams 
and the whole business that I got the Nobel Prize for came from that 
piddling around with the wobbling plate.” [8] 


Remark. While the angular momentum and ~w are difficult to measure directly, 
the slow motion photography explained in section 5 often produced two frames where 
the coin clearly completed one revolution and the angle AA could be measured. From 
(3.8b), this gives . An example is shown in Figure 9. A second method of estimating 
w from photographs is explained in section 5.2 below. In this second method one 
reconstructs the normal’s time evolution and uses this to determine the radius (again 
w) of the resulting circle on the sphere. We have used four photographed tosses to 
check the first method. For toss no. 27, this method produced an estimate w of w. 
We observed y= = 1.48. For toss no. 30 we get y= = 1.47; for toss no. 32, y= = 1.40; 
and for toss no. 33, v= = 1.36. These are in close agreement with the first method of 
estimation. 

Proof. Our proof relies on Euler’s description of the motion of a rigid body, 
essentially the dual to (3.1) [25, sections 36, 36.5-36.7]. Let X be a vector which is 
fixed in space, and let 7 be the corresponding vector, viewed in the body frame. Then 
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FRAME 48 FRAME 68 FRAME 88 


Fig.9 Theorem 4 illustrated; these images are separated by exactly one coin flip. 


Euler asserted that d#/dt = —w(t) x Z. Euler’s equation is obtained by taking for X 
the angular momentum M, 


dm/dt = —w(t) x m. 
Since w = wy it — Wprři (see (3.5)) and since m x m = 0 Euler’s equation becomes 
(3.9) dit /dt = wprñ x m. 


To complete the proof, we have N(T) = N(0), where T = 2r /wy is the period of N’s 
precession. The evolution equation (3.9) for ™ asserts that the projection b of 17 onto 
the plane of the coin precesses at frequency wy, relative to a frame rigidly attached 
to the coin. (See the geometric description following (3.1).) Consequently, after time 
T the vector b has rotated by an amount wprT. In the meantime, in the lab frame 
the plane of the coin has returned to its original position and the vector M has not 
moved. But 5 is simply the projection of M , viewed relative to the coin’s frame. It 
follows that the coin’s frame has rotated by —wprT = AA about the normal in the 
same time. o 


4. Uniformity of Angular Distribution. The proof of Theorems 2 and 2* relies 
on the assertion that 0 = wynt, modulo 27, tends to the uniform distribution on the 
interval [0,27). It remains to establish this uniformity. The key is a theorem stating 
that if we take any “nice” real random variable X, rescale it, and view the result 
modulo 27, then as the rescaling tends to infinity, the distribution of the random 
variable becomes uniform. In symbols, AX mod 27 tends to the uniform distribution 
U on the interval [0,27) as A > oo. Such scaling theorems were used in Poincaré’s 
original treatment of roulette. A comprehensive treatment is in [5]. The main theorem 
of this section, Theorem 5 below, establishes a quantitative version of the needed 
uniformity. 

The product wyt(mod 27) depends on the initial conditions: 


wy = ||M||/ and t=t.=vz/(g/2). 
(See (3.3) and (3.6).) For vigorously flipped coins ||M|| and v, will be large. To 


formalize this “largeness” we suppose that the joint probability distribution of w = wy 
and t is of the form g(w — wo, t— to), where the product woto is large and where g(w, t) 
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is a smooth probability density (C+ is enough) with compact support. Note that wot 
is unitless and corresponds to the number of flips. Further, the expectation of w is of 
order wo, and that of t is of order to. 

The argument we are about to present is a variant of one due to Engel and Kem- 
perman and developed in Chapters 2 and 3 of [5], which we strongly recommend. For 
ease of notation and proof we abstract. Let (X),Y,) be a family of real-valued ran- 
dom variables parameterized by the ray 0 < A < oo. The following two assumptions 
are needed: 


(4.1) For some c > 0 there is a y > 0 so that for all À, 


P(Y) < Ac) < y/À. 


For each A and each fixed y, there is a regular conditional 
probability distribution for X) given Y) = y, with 
(4.2) differentiable density px, (x|Y) = y), satisfying 


[les lelt = vip (dy) = Ay < œ. 


Assumption (4.1) is a way of saying that Y) + oo with À. Assumption (4.2) is a way of 
saying that the marginal density of X) at x, which is f px, (<|Ya = y)py, (dy), is not 
too sharply peaked for any x. In our application À parameterizes a ray in the positive 
(w,t) orthant, and X} = wy and Y, = te are assumed to be distributed according to 
g(w — àwo, t — Ato). For g smooth with compact support, both assumptions (4.1) and 
(4.2) are satisfied with A, uniformly bounded. 

The statement of the next theorem involves the variation distance between two 
measures u,v. This distance is defined by 


dy(p,v) = sup |u(C) — o(C)| 
Cc 


-;/|Z-< 
2 do do 


with the sup over all measurable sets C’. In the second equality o is any measure which 
dominates both u and v (for example, o = u +v). Note that this second equality 
is independent of o. If X and Y are random variables with u(C) = P(X € ©), 
v(C) = P(Y € C) we write d,(X,Y) for d,(u,v). A careful treatment of variation 
distance is in [5]. 

THEOREM 5. Let (X),Y) be a family of real-valued random variables satisfying 
(4.1), (4.2). Let U be a uniform random variable taking values on the interval [0, 27]. 
Then, for all A, 


do, 


dy(X,Y)(mod 27),U) < 


+ 


+|R 


M 
4c 
Proof. 

dy((X,¥Y,mod 27),U) < J eo(Xxu(moa 2m),U) Py, (dy) 


< PAY, <A} + | dy (X,y(mod 2n),U)Py, (dy) 
{Y\>cA} 


Y\>cdA} 4y 


T 
ee J i Ine, (al¥a = y) de Py, (dy) 
y T 

Baie A 

x A 
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The first inequality follows from Proposition 2.8a of [5]. This says that if X and Y 
are real-valued random variables and Z is a random variable defined on the same 
probability space, then 


dy(X,Y) < | dy(X*,Y)P.(dz), 


with X* the random variable conditional on Z = z. The second inequality uses dy < 1. 
The third inequality uses (4.1) and Theorem 3.3 of [5], the theorem mentioned in the 
first paragraph of the present section. This theorem states that for a real random 
variable X with differentiable density p(x), 


dy(tX(mod 27), U) < T f W'@lae. 
O 


The last inequality uses assumption (4.2). 

Remarks. 

(1) The roles of X),Y) can be reversed and the minimum of the two resulting 
bounds used. 

(2) The rate of convergence, order 1/A, is the best possible. However, there are 
classes of examples where the convergence in Theorem 5 is exponential in A 
or even faster. See [5, Chapter 3] for the full range of possibilities. 

(3) In the present application the convergence of X)Y,(mod 27) to U is used 
in conjunction with f(0) of Theorem 1* to obtain Theorem 2. Since f is a 
bounded continuous function, weak-star convergence to U will do. [5] shows 
that AX (mod 27) converges weakly to U if and only if the Fourier coefficients 
of X (mod 27) converge to zero. Thus densities such as g(w, t) are not required 
for Theorem 2 to hold. 


5. Estimating Angular Momentum. The empirical distribution of the angle w 
between the angular momentum vector M and the normal to the coin figures crucially 
in our analysis of coin tossing. In this section we describe our efforts at estimating 
this distribution for real flips of a U.S. half dollar. 

We carried out two types of experiments. Our “low-tech” experiment involved 
flipping a coin with a ribbon attached. After the flip the ribbon is unwound, showing 
how many times the coin rotated. This is described in section 5.1. Our “high-tech” 
experiment involved a high-speed camera and the mathematics of section 3 to estimate 
the sequence of normals to the coin at different times during the coin’s flight. These 
lie on a circle centered at the angular momentum vector. The radius of this circle 
gives an estimate of ~. These results are described in section 5.2. This estimate of 
yw can be calibrated with the measured value of rotation of the coin’s face using the 
relation between this angle and w described in (3.8). 


5.1. Coin and Ribbon. A 1/8-inch wide 29-inch long ribbon was attached to a 
U.S. half dollar with Scotch tape. For each flip, the ribbon was flattened (untwisted), 
with the end held in the left hand. The coin was positioned heads up, in a fixed 
orientation in a fixed coordinate system marked on a table. The coin was given a 
normal flip with the right hand, caught without bouncing at approximately the same 
height. For all flips that resulted in heads up, two numbers were recorded: 

e The angle 0 that a fixed point on the coin’s head made with its original 
starting orientation. Here —180° < 0 < 180° with 180° and —180° indistin- 
guishable. The angle was measured to the nearest 5°. 
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Fig. 10 Scatterplot of ribbon data. 


e The number f of complete flips of the coin, determined by unwinding the 
ribbon until it was untwisted. 

Pairs (0, f) were recorded for 100 flips (1 < i < 100). A scatterplot of these values 
appears in Figure 10. There are three noticeable features: 

1. Four of 100 flips have f = 0 (the coin never turned). These were vigorous 
flips with 0 ranging widely. This indicates that the angular momentum vector 
makes angle at most 45° with the normal with probability roughly 1/25. Re- 
enforcing this, 3 of 100 flips have f = 1. 

2. The angles 0 vary widely. They have minimum of —170°, lower quartile —1.3°; 
median and mean about 30°, upper quartile of 55°; and maximum of 180°. If 
the angular momentum vector was in the plane of the coin (Yy = 7/2), then 
all of these 6; would be zero. 

3. The (6, f) pairs seem independent of each other (corr = —.2). Indeed, the 
distribution of 0 seems roughly uniform. One explanation is that the marked 
point on the coin makes several turns around. 

All three observations reinforce the idea that typical coin flips often have the angle w 
far from 7/2. 


5.2. Slow Motion Photography. We used a high-speed slow motion camera to 
record 50 coin flips. The camera, developed by Stanford’s digital photo program, 
shoots at up to 1400 frames per second (see [6], [23]). We found it best to film 
at about 600 frames per second. In contrast, the slow motion feature on standard 
camcorders shoots at about 60 frames per second which is much too slow to give any 
useful data. 

The data collection and processing led to interesting, difficult problems. Briefly, 
our camera gave about 100 frames per flip. At 600 frames per second, this gives a 
window of 1/6 seconds to record. We found careful effort is required to start the 
filming so that enough of the flip was recorded. 

A successful filming results in up to 100 two-dimensional images. As explained 
below, a circular disc projected onto a plane results in an ellipse (see Figure 11). We 
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Fig. II The coin is projected onto an ellipse in the camera plane. 


Fig. 12 Fit of two ellipses to the image. 


painted the coin with two colors, black outside and a white disk painted half way into 
the coin (Figure 9). These two overlapping circles enabled us to check the camera 
calibration using [11] and [39]. See Figure 12. 

To accurately fit the ellipse, we used MATLAB’s image analysis software to 
threshold each image and detect the edge of the coin. This provided a set of points 
approximately situated on the ellipse. Next, the least squares fit to the ellipse was 
determined using the approach of [3] adapted by [12]. These steps resulted in a fitted 
ellipse for each image (see Figure 12). 

From each ellipse the major and minor axes were determined. From these, as 
described below, the normal to the coin in three dimensions can be estimated. As a 
check, we superimposed our fitted ellipses and normals on the sequence of images and 
viewed the resulting movie. The fits seemed consistently good although not perfect; 
particular difficulties are encountered when the coin is close to edge on. Then, the 
thickness of the coin’s edge becomes an important factor. The sequence of (a) coin 
images, and (b) coins with fitted ellipses can be viewed by the reader at http://www- 
stat.stanford.edu/~susan/coins/. 

At this stage, for each flip, we have a sequence of fitted normal vectors in three 
dimensions, centered at the coin’s center of gravity. According to the theory of section 
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3, these normals lie on a circle centered at the fixed angular momentum vector. The 
radius of this circle thus gives an estimate of the angle w associated with the flip. Of 
course, the circles can be fit from just a few points. We used about 20 points/flip and 
again checked visually to see if these looked as if they lay on a circle. Some surprising 
results are described following the basic normal fitting algorithm which is described 
next. 

The plane of the camera is fixed throughout. In spatial coordinates (X1, X2, X3), 
the (X1, X2,0) plane will be identified with the camera plane and the line (0,0, X3) is 
the orthogonal to the camera plane. At a fixed time, the coin is in a fixed position in 
three-space. The projection of a circle or disc onto a plane is an ellipse. We observe, 
and can accurately estimate, the major and minor axes of this ellipse. The assumption 
of orthographic projection implies that the length of the major axis is the same as the 
length of the diameter of the coin. Without loss of generality we assume the coin has 
radius 1. Let A= (Aj, Ag, 0) be a unit vector in the plane of the camera centered at 
the ellipse, center along the major axis. Let B= (Bı, B2,0) be an orthogonal vector 
along the minor axis. Thus |A| = 1 and |B| = cosé@ for some angle 0, 0 < @ < 7/2. 
This description of A,B involves a choice of + sign which we will deal with in a 
moment. 

Throughout, we assume that coordinates have been chosen so that the center of 
the coin is at the center of the ellipse. 

Let U i V be the unit vectors on the coin, which project to A, B , respectively, so 
that in fact A =U. Let K = (0,0, 1) be the direction orthogonal to the camera plane. 

LEMMA 1. With notation as above the normal Ñ to the coin is 


N = (a4: 1— (B? + B3), E€&2Á1 1— (B? + B3), ce3( A1 Bo =, AaB1)) 


for some choice of signs ei = +1. 

Proof. Any vector projecting onto B has the form V = B+ AK. Since our V 
is a unit vector and |B] = cos(@), |A| = sin@ is forced. Thus Ñ = Ax V = A x 
(B+ esin 0K). Expanding out the cross product and using sin 0 = +,/1 — (B? + B2) 
yields the result. Oo 

Remark. The sign ambiguity creates a serious practical problem. If the coin 
is parallel to the plane of the camera or edge onto the camera, the sign is difficult 
to determine (the projections in these cases are either perfect circles or a segment). 
While these events happen rarely with a tumbling coin, they do occur periodically 
and an arbitrary choice leads to physically ridiculous pictures. We resolved the choice 
of signs by continuity, choosing (at each time frame) among eight sign patterns that 
makes the inner product of the current normal and previous normal as close as possible 
to a constant, while keeping the curvature of the curve of normals continuous. 

Figure 13(b) shows the result of this unscrambling process. In Figure 13(a) we 
see the sequence of three-dimensional normals as they come from the computation of 
normals one by one from the ellipses. Successive normals are numbered in sequence. 
There are some approximately circular arcs; however, the result is a mess. In Fig- 
ure 13(b) we see that the unscrambled normals, unscrambled by appropriate choice 
of signs, all lie clearly around the circle. Our theory implies that these points lie in a 
plane in three-dimensional space. We fit the plane using least squares. We compute 
the distance d between the plane and the origin from which we can find w = cos~'(d). 


5.3. The Results. Of our 50 flips, 27 gave useful final results. From the measured 
values of y, the probability p(w) was calculated from Theorem 2. The estimated prob- 
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Fig. 13 (a) The normals originally have a scrambled sign pattern. (b) How the normal vectors 
(once unscrambled) sit around a circle in three dimensions. 


abilities range from 0.500 to 0.545. The 27 probabilities are displayed in a stem and 
leaf plot in Table 1. The first row of this plot shows the values 0.500, 0.500, 0.501,... 
indicating occurrences of flips for which p(w) took on these values. The next-to-last 
row shows no occurrences between 0.540 and 0.545. The last row shows the single 
outlying value 0.545. The median and standard deviation are 0.5027 and 0.0125. The 
mean of these probabilities is 0.508, and we have rounded this up to the 0.51 quoted. 
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Table | Stem and leaf plot of the estimated probabilities. 


50 | 001111111222333334 
50 | 555 

51 | 3 

51 

52 | 3 

52 | 9 

53 | 34 

53 

54 

54 | 5 


Table 2. Estimates of y of Theorem 2. 


1.3697 1.2125 1.4682 1.2778 1.3103 
1.2611 1.5528 1.4478 1.5211 1.5182 
1.5100 1.2560 1.4797 1.4829 1.4705 
1.5228 1.4696 1.5114 1.5102 1.4983 
1.4962 1.4603 1.5176 1.4408 1.4519 
1.5489 1.4800 


For completeness, Table 2 shows the w values; these have mean 1.446 and standard 


deviation 0.097. Note that if z denotes the mean of z, then p(y) 4 p(w). 


6. Some Caveats to the Analysis. In carrying out the present analysis we make 
a number of assumptions. In this section we point out naturally occurring situations 
where our assumptions are violated, and hence our analysis need not apply. 


6.1. Random In, Random Out. We have assumed that the coin is flipped with 
a known side uppermost. In many occurrences of coin tossing a coin is removed from 
the pocket and hence may be assumed as equally likely to start heads up as tails up. 
The physics preserves this: the outcome is as equally likely to end heads up as tails 
up. We have friends who preface a coin toss by vigorously shaking the coin between 
their cupped hands. 


6.2. No Air Resistance. Throughout, we have neglected the effect of air resis- 
tance. Let us begin by acknowledging that air resistance is a potential confounding 
factor. Some friends in the physics department convinced us of this by dropping a 
U.S. penny off Stanford’s Hoover Tower. Some of the time it fell like a leaf, flutter- 
ing to the ground. We believe that for our short flips, air resistance has a negligible 
effect. One way to test this is to observe the time tı that a coin takes to go from 
its start to the top of its trajectory and then the time tə that the coin takes to fall 
back to its initial starting height. As discussed in [4] air friction forces tə > tı. We 
expect the size of this effect to be very small and an accurate measurement of it to 
be very difficult. Zeng-Yuan and Bin [38] included air resistance in their analysis of 
coin tossing, presuming the effect is proportional to velocity. In the limit t —> oo of 
Theorems 2 and 2* air resistance could be nonnegligible. 

Careful modeling of a spinning coin in a retarding medium seems like a difficult 
problem. Even determining an appropriate approximation to the effect of friction on 
velocity is a contentious matter. Long and Weiss [27] discussed the classical assump- 
tion (frictional force proportional to velocity) in detail and argued that velocity raised 
to powers such as 3/2 or 2 seems more appropriate. 
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We do not believe that air resistance is a problem for naturally flipped coins, but 
as a referee pointed out, it probably is a problem in the large spin, large time of flight 
limit. This is one of the reasons we gave quantitative nonlimiting results in section 4. 


6.3. Definition of Time in Flight. We have used two different notions of the 
time te that the coin comes to rest (sections 3.1, 5.2). We have not incorporated the 
very real possibility that the time te has extra randomness due to the catcher’s hand 
moving, or a psychological component due to the coin being caught early or late in its 
flight. We do not think that these variations in te materially affect our conclusions. 
The reason is that flipped coins (starting from heads up) simply spend more of their 
total time in flight heads up. Figure 4 shows this clearly. If the coin were caught 
at a completely random time (say, uniformly chosen in a large interval), our analysis 
would still apply. 


6.4. Start Heads Up? Perhaps the most vulnerable assumption is the specifi- 
cation of the starting normal direction as heads exactly up. Careful observation of 
natural flips shows some play in the initial position due to the position of the hand and 
thumb. We have dealt with this mathematically in section 3.2 but have not gathered 
data. Of course, this point is closely connected to point 6.1 above. 


6.5. No Bouncing. We have assumed that the coin is caught without bouncing. 
While this is a very common occurrence, we note that often a flipped coin is caught 
in the hand and then slapped down on the table or on the back of the other hand, 
turning it over once. Of course, this last results in a bias opposite to the start. If the 
method of catching is not determined (sometimes flipped over, sometimes as fallen), 
a significant amount of randomness may be added. 

Often, flipped coins are allowed to land on hard surfaces and bouncing occurs. 
This requires a different type of analysis. Some further discussion and references are 
in section 2. 


6.6. The Pragmatic Uncertainty Principle. The measurements recorded in sec- 
tion 5 may have affected the outcomes recorded. For example, attaching a ribbon 
to the coin changes the aerodynamics of its flight. To check this, we recorded the 
angle 0 for 100 flips of a half dollar without the ribbon attached. The results were 
still widely spread out but now the lower, median, and upper quantiles were shifted 
to —40°,0°,30°. Similarly, getting the slow motion camera in sync with our flips 
required fairly artful flips. These seem quite different from the more vigorous flips 
one observes at sporting events. We hope to develop a setup with multiple low-speed 
cameras that allow measurement of more natural flips. 


7. Conclusion. Despite these important caveats we consider the bias we have 
found fascinating. The discussion also highlights the true difficulty of carefully study- 
ing random phenomena. If we have this much trouble analyzing a common coin 
toss, the reader can imagine the difficulty we have with interpreting typical stochastic 
assumptions in an econometric analysis. 

The caveats and analysis also point to the following conclusion: Keller’s analysis 
gives a good approximation for tossed coins. To detect the departures of the order of 
magnitude we have found would require 250,000 tosses. The classical assumptions of 
independence with probability 1/2 are pretty solid. 
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